Pattern-Responsive Lexicon Optimization
نویسندگان
چکیده
In this paper, we show that current interpretations of Lexicon Optimization (Prince and Smolensky 1993), in particular that of Archiphonemic Underspecification (Inkelas 1995), incorrectly predict the distribution of underspecification in lexical entries. We present cases from three vowel harmony languages in which speakers treat harmonic and disharmonic roots differently under reduplication. The assumption of full specification entails a ranking paradox, which can be resolved if underspecification is admitted in certain contexts not predicted by the principles of Lexicon Optimization. We point the way towards an expanded model of Lexicon Optimization that would both allow for and predict such cases of underspecification.
منابع مشابه
Searching Large Lexicons for Partially Specified Terms using Compressed Inverted Files
There are many advantages to be gained by storing the lexicon of a full text database in main memory. In this paper we describe how to use a compressed inverted file index to search such a lexicon for entries that match a pattern or partially specified term. This method provides an effective compromise between speed and space, running orders of magnitude faster than brute force search, but requ...
متن کاملOptical Character Recognition for Cursive Handwriting
ÐIn this paper, a new analytic scheme, which uses a sequence of segmentation and recognition algorithms, is proposed for offline cursive handwriting recognition problem. First, some global parameters, such as slant angle, baselines, and stroke width and height are estimated. Second, a segmentation method finds character segmentation paths by combining gray scale and binary information. Third, H...
متن کاملLexicon Optimization for Chinese Language Modeling
In this paper, we present an approach to lexicon optimization for Chinese language modeling. The method is an iterative procedure consisting of two phases, namely lexicon generation and lexicon pruning. In the first phase, we extract appropriate new words from a very large training corpus using statistical approaches. In the second phase, we prune the lexicon to a preset memory limitation using...
متن کاملBlocking effects at the lexicon/semantics interface and bi-directional optimization in French
We document two cases of partial blocking at the lexicon/semantics interface concerning the interpretation of inchoative French verbs (craquer 'snap', se briser 'break', (se) casser 'break'). One concerns the obligatorily referential interpretation with non-reflexive-marked verbs like casser in a sentence whose pronominal subject il is potentially ambiguous between a referential and a non-refer...
متن کاملLexicon optimization for dutch speech recognition in spoken document retrieval
In this paper, ongoing work concerning the language modelling and lexicon optimization of a Dutch speech recognition system for Spoken Document Retrieval is described: the collection and normalization of a training data set and the optimization of our recognition lexicon. Effects on lexical coverage of the amount of training data, of decompounding compound words and of different selection metho...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000